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FIGURE 1A 

Frame3 lcsllfglslsitmf emm p 

»2 V M * S F V W S L P * H M D V * D D A I 

Framel CYVV FCLVSPLA*RCLR*CH 

DNA TGTTATGTAGTCTTTTGTTTGGTCTCrCC^ 

Frame3 " " f" * i" "h* ' f" "c" "c" "*" "a" "a" "a" " e° " y cwnpsl.fi 
Frame2 h sflllsscrvllesqfihw 

Framel S F IFVAEQLPSIVGIPVYSL 

DNA TCATT ^ hCV26544776 ^SNPG98~'al662 833 

Framed g"f"c' V S 's"*"*' "t *C G "f* "l" 'q 'i' *g" 'f* V "l N 

Frame2 FLCLQLIDMWIPPV RVCY** 

Framel V S VSP VDRHVDSSS*GLLLM 

DNA GTTTCTGTGTCTCCAGTTGATAGACATGTGGATTCCTCCAGTTAGGGTTTG^A^AATG 180 

I SNPG168 AL662833 

Fraiei' "e "a "f "i" ^"n* "c "l" "q" "v" 'w "t "y "i > I S F G * 
Frame2 S H Y K * L L T S V D L H F Y F F W I N 

Framel KPL*ITAYKCGLT F L F L L D K 

DNA AAGCCACTATAAATAACTGCTTACAAGTGTGGACTTACATTTTTA I I I CI I I I GGATAAA 240 

R hCV27464285 rpt 

| SNPG205 AL67188 

Fram43""i'"R"'i''c'"G"4''A"G"'p"'c''G"'N"'R" M W V "e" 't* " 

Frame2 TYLWNCWAMW* - MGNCIRNC 
FrSSl Y V F V EL LGHVVIDG* LYKKL 
DNA TACGTATTTGTGGAATTGCTGGGCCATGTGGTAATAGATGGGTAACTGTATAAGAAACT 

| SNPG298 
AL662833 

Frame2 HTTLQIGC .HIFCIPTSN IRH 
Frtmel P Y H F T N W L P H F L H S Y Q Q Y Q T 
DNA CCATACCACTTTACAAATTGGCTGC(^CATTTTTTGCATTCCTACCAGCAATAT^ 360 

hCV2 7464284 K rpt 

| SNPG354 

AL67188 

Y hCV27465835 rpt 
| SNPG333 AL67188 

Frame2 S Y F F H I L A S V K T Y H M S F * L Y 

Framel FLFFPYSCQC*DLSYV F L T L 

DNA TTCCTATTTTTTCCATATTCTTGCCAGTGTTAAGACTTATCATATGTC I I I I I AACTTTA 420 

Fram43'T'c' V V"*' V V V VV V 'f'^ \ h f f d 

Frame2 L L * V M C D G F S L W F * L A L L * * 
Framel SALGDV*WFLIVVLTCT5 LM 
DNA TCTGCTCTAGGTGATGTGTGATGGTTTCTCATTGTGGTTTTAACTTGCACI I CI I IGATG 480 

Y hCV27466112 rpt 

| SNPG421 AL67188 w SNpG476 

AL67188 
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FIGURE IB 

Frame3 D*YCLLSFHVHLSDLLHIFY 
Frtme2 LVLFAIFSCSSKRLITYIL* 
cramAl T s I V C Y L FM FI * ATYY IY FM 
K mel ACTAGTATTGTTTGCTATCrrTTCAT^CATCTA^ 540 

| SNPG518 AL67188 

AL67188I SNPG539 

Frames' 'e'"L'>''^"k"'f''n" "6" '**"f""q'r"'l^ 

Frame2 T I L Q I Q * L ^ ^ ^ _ _ _ p p * S r V F F 

cr-amel NYFANSMINSRDFFRIP C F 

DNA AACTATTTTGCAAATTCAATGATTAATTCCAGAGACTTTTTCAGAATTCCCTAGTGTrTT 600 

Framed VVVT'l^VVVVTT VTVV'f'l s Y P 
Frame2 Y I Y N E V G D K E R L S F L P F L S I 
Framel JHIQ*SW*QRKTFISSFLIH 
DNA CTACATATACAATGAAGTTGGTGACAAAGAAAGACTTTCATTTCTTCCTTTCTTATCCAT 660 

| Y SNPG618 hCV26544780AL662833 AL67188 

Frame3 L I F F L L K L L LFGRDEVSLIR 
Frame! D L F S F K I I I I W * R * G L T Y Q A 
Framel *SFFF*NYYYLVEMRSHLSG 
DNA TGATCTTTTTTCTTTT A A AATTATTATT ATTTGGTAG AG ATG AGGTCT CACTT ATCAG G C 720 

ISNPG686 
| SNPG667 hCV27463682 

I SNPG710 



Frame3° "l""v""s""n""s""*""sq v i l p p Q p p k m q g 
Frame2 GLKLLISSDPPTSASQNA G I 
Framel W S Q T PDLK*SSH LS L PKCRD 
DNA TGGTCTCAAACTCCTGATCTCAAGTGATCCTCCCACCTCAGCCTCCCAAAATGCAGGGAT 780 

Fram43'"L""Q''A""*'"A"'T"'M""p'"G*'p'"c'"c''T'"G''* C * V 

Frame2 TGM S H HAWS LLHWLG* L L G V 
Framel Y R H E P PCLVLVALVRMTVRC 

DNA tacaggcatgagccaccatgcctggtccttgttgcactggttag<^ 840 

Frame2 * T R M M R A H M F V Y K E L K Q I Y K 
Framel LNKNDESSHVCLQGT*TNLQ 
DNA TTAAACAAGAATGATGAGAGCTCACATGTTTGTTTACAAGGAACTTAAACAAATTTACAA 900 

LF1.1 ATGATGAGAGCTCACATGTTTGTTTACAA 

LF2 . 10 GCTCACATGTTTGTTTACAAGGAACTTAAA 
I SNPG856 

Frame3" "r" "k" " k" " p" "i" " p" " i " ' k" ' k' * w* "a" "k" 'd i n r h f s e 

Frame2 K K T H P H Q K g V G K r G ^ ^ ^ ^ g L q R r G 
DNA™ 61 GAAAAAAACCCATCCC^T^AAAAGTGGGCAAAGGATATAAACAGACACTTCTCAGAGG 960 
I SNPG902 INS G DNA14AL1 

| SNPG908 INS A DNAP5AL1 

| SNP930 Pan troglod. A 
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FIGURE 1C 

FrameB E D I Y V A K K H M K K S S H T Y M K R 

FrSSl K T F t W p R N I * K kahtri nv begin 
of homoloav orfl to tre homologue <R154 ^.^.^ -imn 

I SNP1006 

I SNP1018 

I SNPG972 

| R SNPG975 

Frames" '6' ^''L'"*''s''y''p''k"T"n'' ; "'f'"q'"a" n ^ 

F rame2 LFIILSKKELISSNSSIT 5 I 
Frtmel T V Y N PI QKRTD FKQQQYY FH 
DNA ACTGTTTATAATCCTATCCAAAAAAGAACTGATTTCAAGCAACAGCAGTATTACTTCCAT 1080 

- I SNPG io48 

| SNPG1056 

| SNP1069 

| SNP1073 

| SNP1079 

Framed" f"n"t 'w V "c" "k "h' *Q V "s" "h" V "r' 'n" * T T L * 
Frame2 Q Y L D L Q T S K K P L E K L N D S L K 
cramel STLGPANIKKATGETERLSt 
DNA TCAATACTTGGACCTGCAAACATCAAAAAAGCCACTGGAGAAACTGAACGACTCTCTGAA 1140 

fp^3'"k"'p""*'T"k"4""*"'r""s'"*"n''l"'e""k"t'"*'"g k g q 

Frlme2 A L N * D M K K L K S G K N L R K R T G 
- -i qi kLRYEEVEIWKKLEEivLJK 
dna AGCCTTAAACTAAGATATGAAGAAGTTGAAATCTGGAAAAAACTTGAGGAAAAGGACAGG 1200 

R 

hCVl5819424 ISNPG1199 

I SNPG1181 

I SNPG1154 

Frames" "a" "g" "g" "s" *t" "v" "a" "t" "t* "k" "k" "a" "g" N R K R G W Q 
Frlmel R G K H S G Y N K K r G R K Q E r E R H f f 

DNA mel cagggggLgc^cXgtggctacaacaaaaaaggcaggaaacaggaag^ 1260 

I SNPG1236 

Frame3 " ' H* V* g' "*" " R* * F* " F* ' G* "d" c" ° I " *g' " f" Q R Q N P K E 
Prime! C W L K V L W R L y Y ^ ^ $ P ^ ^ ^ ^ q K R $ A 

DNa" 161 ATGTTGGCTAA^GGTTCTTTGGAGATTGTATTGGATTCCAAAGACAAAACCCAAA^ 1320 
| SNPG1262 
| R SNPG1263 

| SNPG1274 ( 

SNPG1319 

Frame3'"a''w '*''k'"e'"*'"k'"m';''d"'q 'r 

Frame2 M V K R M K N V ^R _P f R ^ ^ Q ^ * ^ E N 
DNA 1 " 61 AATGGTGAAAAGAATGA^A^TGTGAGACCAAAGAGAAAGGAGCAATCACAGCAAAGGAA 1380 

I R SNPG1334 
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FIGURE ID 

Y N G C S K N 
* W M L K E 

c'ramol i ' v' T* M M m" d~ K N I S L I I M D A Q R M 
DNA CTATACACAATGATGATGGATAAAAACATCAGCTTGATTATAATGGATGCTCAAAGAATG 1440 

| SNP14 SNPG1426 



TIHNDDG*KHQ L D 

**WTKTSA*L*WMLKEC 



Y T Q 



hCV27504712 



Fram4§' VV*L S G F L Y F T F S Q C S * K S H Q 
Frame2 R I I R I P V FY I L S V F L K K P 5 V 
FramPl ODYQDSCILHSLSVPEKAIS 
DNA CAGGATTATC^GGATTCCTGTATTTTACATTCTCTCAGTGTTCCTGAAAAAGC 1500 

K hCVll691030 IR SNPG1489 

hCVl6 ° 30281 |Y SNPG1466 



SNPG1444 



| SNPG1487 SNP13 

| SNPG1495 T 



Pan trog. 



Frame 3 
Frame2 
Framel 

DNA 



Frame 3 
Frame2 
Framel 
DNA 



s"r"s"h"c *ld*stppr*fyrym 

Q E S L L A G L K H T S Q M T L * I H G 
P GVTASWI EAH L PDDS IDTW 
CCAGGAGTCACTGCTAGCTGGATTGAAGCACACCTCCCAGATGATTCTATAGATACATGG 15 60 

|SNPG1518 
| SNPG1505 hCVl5819434 
| Y SNPG1508 

' 1 SNPG1521 ISNPG1554 

| R SNPG1543 SNPll 
EEEGECGVYGTS* LV* FCKR 
RRGGMWSIWYFLTGLV L Q K I 
K K RGNVEYMVL L DWFS SAKD 
AAGAAGAGGGGGAATGTGGAGTATATGGTACTTCTTGACTGGTTTAGTTCTGCAAAAGAT 1620 

| SNPG1612 C Pan trogl . 



Frame 3 
Frame 2 
Framel 
DNA 



FTDWN NT LAS E RCT F Q v G K 

YRLEQHSGI*KMHFSSGKVK 

L OIGT TLWHLKDA LF KWESK 

TTACAGATTGGAACAACACTCTGGCATCTGAAAGATGCACTTTTCAAGTGGGAAAGTAAA 1680 

ISNPG1630 hCVl5819435 



ISNPG1680 



ISNPG1638 hCVl5819436 



K T G 



Fr^me3"N""c' P V Q W A L G L W F * R E A I 
Frame2 L S C A M G L G P L V L E G G Y K N W F 
Pram ai TVLCNGPWAFG FRGRL*KLV 
DNA ACTGTCCTGTGCAATGGGCCTTGGGCCTTTGGTTTTAGAGGGAGGCTATAAAAACTGGTT 1740 

|SNPGDELC17070 

|SNPG1687 
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FIGURE IE 



. ccatpSIQQMLRSLHPHNTR 
FrSSSl L C Y S Q Y T T N A K V T P P P Q H Q N 

CCTTTGCTATTCC^GTATACAACAA^TGCTAAGCTCA 1800 
| SNPG1742 



i m 'u"y'k"c"i"k"U"\l"l LLIPHWKNQFL 

FrSSSi E E L S I S L D F T Y P S L E E S I P S 

K? 1 tgaag^g^ctItct^^ 1860 



DNA I SNPG1810 Y 

ISNPG1813 



SNPG1841 



3 , m i i prCHLHL*KWMKT*N* 
Frame3 L N U L P R *. n l n u z 
Frame2 K P A A E M P P P P „ _ g * ^ H R j; D 

TAAACCTGCTGCCG^GATGCCACCTCCACCTATAAAA 1920 

| SNPG1895 hCVl6030290 

| SNPG1874 

I SNPG1875 lsNpig07 

ISNPG1909 



c^moi" "i" *v" "t"k" "*"v " i" M " i K M R G Q D H * I Y Q 
Frt: e e2 S V D Q I S V jD # D Q N ^E R T r G ^ ^ £ N y I ^ 1 

DNA 1 " 61 aaVtgatIaaaVa^ 1980 



ISNPG1957 

SNPG1921 

I SNPG1922 |SNPG1959 



FrameS F Q L N Q L L L L N L M F H P S F S Q C 
Frame2 P V E S V A A S J< S V p S V ^ h S A S A 

Framel 5 S 3AATCAGTTGCTGCTTCTAAATCTGATGTTTCACCCATCATTCAGCCAGTGCC 2040 



ISNPG1976 

• I 5 s C C A F 3 z ' C " F" T H" h" A S / 

DNA TCCAGTTG/ 

| SNPG1993 

Frame2 S I K N v k u j. # u # ^ ^ Q g Q x A * 

S5T 1 tIgcataaagLtgttc^gattgatcatactaaaaaactggcagtca^ 
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FIGURE IF 



N S L L R M K 



Pra m e3 K S I * S N L K V Q ^ ^ £ S q q g p q n k 

*KYKS*ATVSSE*K 



Frame2 E H I I K 

Sna" 161 A^G^WaXtCAAATCT^ 2160 



SNPG2110 del AA P. trogl 



| EXON 3A IN P7N 



Frame3 
Frame2 
Framel 
DNA 



K L F L I 

V I P D C S 
S Y S * L F H 



v'pPSQ*FPLQLSC*Q 

TKPVVSSPTLMLTD 
*LFHQASSFLSNSHVNR 
AGTTATTCCTGATTGTTCCACCAAGCCAGTAGTTTCCTCTCCAACTCTCATGTTAACAGA 

ISNPG2180 hCVl6030297 

ISNPG2192 , j 

ISNPG2193 G Pan trogl od. 



2220 



Frame3 
Frame2 
Framel 

DNA 



M 



KKRLIFMQKLLF 
EEKAHIHAE T ^A 

TG AAG AAAAGG CTCATATTCATGCAG AAACTGCTCTT CTAATGGAGAAAAACA AACAAGA 



W R K T N K 
M E K N K Q E 
N G E K Q T R 

2280 
5NPG2275 



ISNPG2230 



VrlZl * / e" L F q" a" R° Q S Q K 0 Q k" q" k" E " itJS Aft of hLonogy 
orf3 T E E G R T R 

DNA 1 " 61 aLaG^ACTTCAGG^GACAG^^ 2340 



Frame3"E" A Q ^aV ^ ^ ^ r E & a ^e r E ^ # e x t ^ K Q q ^ 
oS mel JcaIwwgcILu^^ 2400 

SNP | SNPG2375 

Frame3 ' " k" "a" "k" _e' ^e" ^m'^e" ^k" ^k" ^e'^r" ^ 

Frame2 S K R R N G E E R T I u p u K R ! K k 

DNA AAGCAAAAGAAGA^TGGAG^AGAAAGAACGTGAAC^GGCCAAGAAAGAGGATAAA 2460 

hCVl6030298 

Frame2 lskegqrnnks^kki k vi m 

SnI" 161 tItcagc^ILaEg^^ 2520 
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FIGURE 1G 

K=S E „ T L S w G c A R E E K 

DNA mel A^CCTCTGGTGCCGA^AGTCTGTAGAGGAC^GGGGAGAAGATGTTCAACCCCAGAAG 2580 

<EXE1626 2_not in Lucy 

<EXEXb3 - L — <EXE1660__ 



_R2533 



ISNPG2525 

FrameS V q" K ' K S t"r V V V 'h' TV A T* "g" '_0 '_S G J 



hramea _ „ v ,j ■/ o r V P Y I C D R G F R F R 

S y R K S n Q E M C P I H L R Q G I Q V Q 
lm TACAGAAAAAGTCAACAAGAGATGTGTCCCATACATCTGCGACAGGGGATTCAGGTTCAG 



2640 



ISNPG2618 R 
ISNPG2619 Y 
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FIGURE 2 

SEO ID NO -3 58 DFKQQQYYFHSILGPJ^IKKATGETEW.SESLKI^YEEVEIWKKLEEKDRQGEAQWLQQK 

SEO ID NO - 4 78 DFKQQQDYPHSILGPGNIKKAVEEAERLSESLKLRYEEAEVRKKLEEKDRQEEAQRLQQK 

SEQ ID NO. 4 /» ^ff^^^^^^ «*.** * ************* * ********* *** **** 

SEO ID NO -3 118 RQETGREDGSMI^GSLEIVLDSKDKTQKSNGEKNEKCETKEKGAITAKELYTJJO©^I 

tlo id no -4 138 rqotgredggti^gslenvi^skdktqksngekne^^ 

SEQ ID XJO *uo*«~^ ******* ************************************ **** 

SEO ID NO -3 178 SLII^AQRMQDYQDSCILHSLSVPEKAISPGOTASWIEAHLPDDSIDTWKIWGNVEYMV 

tlo ID NO* 4 198 SLI^ARPJ^QDYQDSCILHSLSVPEEAISPGVTASWIEAHLPDDSKDTWKKRGNVEYVV 

SEQ ID NU.4 x»o ******* ****************** ******************* *********** * 

SEO ID NO -3 238 LLDWFSSAKDLQIGTTLWHLKDALFKWE KGGYKNWFLCYSQYTTOAKV 

tit ID NO: 4 258 LLDWFSSAKDLQIGTTLRSLKDALFKWESKTVLRNEPLVLEGGYENWLLCYPQOT 

^ ***************** ********* *** ** *** ******** 

SEO ID NO -3 2 86 TPPPQHQNEELSISLDFTYPSLESSIPSKPAAEMPPPPIKVDEDIBLISDQISDNDQNER 

tlo ID NO-4 318 TPPPRRQNEEVSI SLDFTY PSLEES I PSKPAAQTPPAS IEVDENIELISGQ NER 

SEQ ID NU.« JJ-o JL*r*£ ********************* ** * *** ***** * *** 

qpo ID NO-3 346 TG PLNI S I P VE S VAASKSDVS P 1 1 QPVP S I KNVPQI DHTKKL AVKL PEEH 1 1 KSE STNHE 

K £ NO; 3 4 3 37 2 MGPLNISTPVEP^ 

SEO ID NO ■ 3 406 QQSPQNEKVIPDCSTKPWSSPTLMLTDEEKAHIHAETALLMEKNKQEKELQERQQGKQK 

K g NO; 3 4 432 SoTO™™^^ 

SEQ ID NO: 3 466 E 

SEQ ID NO:4 492 E 
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FIGURE 3 



rhodanese 
SEQ ID NO: 3 
rhodanese 
SEQ ID NO: 3 
rhodanese 
SEQ ID NO: 3 



*->tagelkalles . apkliliDvRspefGeryeyegGHIpgAvNvplee 
ta +el+ ++ +++ +li++D+ + + Y+ ++I ++ vp e 

164 TAKELYTMMMDkNI SL 1 1 MDAQRM QDYQDSCILHSLSVP-EK 204 

eiealldrsgilpdieklhllkdpeelaklfgelgsskdkrvivycrsgr 
+i+ +++s+i++ hl++d +++k+ g+ + + + +s + 

205 AISPGVTASWIEA HLPDDS IDTWKKRGNVEY MVLLDWFS S AK 246 

dllrnrrsalaalllkklgypeVyiLkGGykeWlak<-* 
+ + ++ ++lk + kGGyk+W+ + 

247 DLQIGTTLWHLKDAL FKWEKGGYKNWFLC 275 
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FIGURE 4 



O CO I s - 
CO lO N- v 3~ 

D O Q O O O 
■ □□!§□ 




1077- Z : °» 8J P e *° ejJO ° 

ui passojdxe s|eAei uoissaidX3 
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FIGURE 5 





stop 



ex1-ex2-ex3 



SEQ ID NO : 52 




ex1 -int1 -ex2-ex3 seq id no : 54 



ex1 -ex2-ex3A seq id no : 55 
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FIGURE 6 



SEQ ID N C:3 ^SSS^ 

sIq S SB =5*1™ 

SEQ ID N 0:3 «»»^^^^^^^^SS^SSSSS 

SEQ ID NO:3 TGREDGS^SLBI^S^^ 
SEQ ID NO: 53 
SEQ ID NO: 56 

s S SS-.ia SSSSSSSSSSSBSSBSSS^^ 

IIq ID NO: 56 WFSSAKDLQIGTTLVffiLKDALFKWEK- 

SHU ************************** 

SEQ ID NO: 56 

SEQ ID NO : 56 ************ : 




SBa ZD »o,3 — ssissssassoss 



SEQ ID NO:3 RS--- 
SEQ ID NO: 53 RSCRK 
SEQ ID NO: 56 RSCRK 



** 



